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© Speech coding method and system. 



© A speech coding method which can code a speech signal at a bit rate of 8 kb/s or less by a comparatively 
small amount of calculation to obtain a good sound quality. An autocorrelation of a synthesis signal synthesized 
from a codevector of an excitation codebook (140) and a linear predictive parameter of an input speech signal is 
corrected using an autocorrelation of a synthesis signal synthesized from a codevector of an adaptive codebook 
(120) and a linear predictive parameter and a cross-correlation between the synthesis signal of the codevector of 
the adaptive codebook (120) and the synthesis signal of the codevector of the excitation codebook (140). A gain 
codebook (210) is searched using the corrected autocorrelation and a cross-correlation between a signal 
obtained by subtraction of the synthesis signal of the codevector of the adaptive codebook (120) from the input 
speech signal and the synthesis signal of the codevector of the excitation codebook (1 40). 
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This invention relates to a speech coding method and system for coding a speech signal with high 
quality by a comparatively small amount of calculations at a low bit rate, specifically, at about 8 kb/s or less. 

A CELP speech coding method is known as a method of coding a speech signal with high efficiency at 
a bit rate of 8 kb/s or less. Such CELP method employs a linear predictive analyser representing a short- 

s term correlation of a speech signal, an adaptive codebook representing a long-term prediction of a speech 
signal, an excitation codebook representing an excitation signal, and a gain codebook representing gains of 
the adaptive codebook and excitation codebook. 

It is already known that, with such CELP method, a better excitation codevector can be searched out to 
achieve an improved sound quality by using, when the excitation codebook is to be searched, simultaneous 

70 optimal gains as the gain of the adaptive codevector and the gain of the excitation codevector. Such speech 
coding method which uses, when the excitation codebook is to be searched, simultaneous optimal gains as 
the gain of the adaptive codevector and the gain of *he excitation codevector is disclosed, for example, in 
Ira A. Gerson and Mark A. Jasiuk, "VECTOR SUM EXCITED LINEAR PREDICTION (VSELP) SPEECH 
CODING AT 8 KBPS", Proc. ICASSP. '90 S9.3. pp.46 1 -464, 1990 (reference 1) and in M. Tomohiko and M. 

75 Johnson. "Pitch Orthogonal CELP Speech Coder", Lecture Thesis Collection I of '90 Autumnal Re- 
search Publication Meeting, Acoustical Society of Japan, pp. 189- 190, 1990 (reference 2). 

Meanwhile, as a speech coding method which codes a speech signal with high efficiency at a bit rate of 
8 kb/s or less, such CELP method which employs a hnear predictive analyzer representing a short-term 
correlation of a speech signal, an adaptive codebook representing a long-term prediction of a speed signal, 

20 an excitation codebook representing an excitation signal and a gain coaebook representing gains of the 
adaptive codebook and excitation codebook as described hereinabove is- disclosed in Manfred R. Schroeder 
and Bishnu S. Atal, "CODE-EXCITED LINEAR PREDICTION (CELP): HIGH-QUALITY SPEECH AT VERY 
LOW BIT RATES", Proc. ICASSP, pp.937-940, 1985 (reference 3). 

According to the conventional speech coding methods disclosed in reference 1 and reference 2, the 

25 excitation codebook has a specific algebraic structure, and consequently, simultaneous optimal gains of the 
adaptive codevector and excitation codevector can be calculated by a comparatively small amount of 
calculation. However, an excitation codebook which does not have such specific algebraic structure has a 
drawback that a great amount of calculation is required for the calculation of simultaneous optimal gains. 



Meanwhile, according to the conventional speech coding method disclosed in reference 3, gains are not 



30 normalized, and consequently, a dispersion of gains is great, which makes the quantization characteristic of 
the speech coding system low. 

It is an object of the present invention to provide a speech coding system which can code a speech 
signal at a bit rate of 8 kb/s or less by a comparatively smalt amount of calculation to obtain a good sound 
quality. 

35 In order to attain the object, according to an aspect of the present invention, there is provided a speech 

coding method for coding an input speech signal using a linear predictive analyzer for receiving such input 
speech signal divided into frames of a fixed interval and finding a linear predictive parameter of the input 
speech signal, an adaptive codebook which makes use of a long-term correlation of the input speech signal, 
an excitation codebook representing an excitation signal of the input speech signal, and a gain codebook for 

40 quantizing a gain of the adaptive codebook and a gain of the excitation codebook, which method comprises 
at least the steps of: 

correcting an autocorrelation of a synthesis signal synthesized from a codevector of the excitation 
codebook and the linear predictive parameter using an autocorrelation of a synthesis signal synthesized 
from a codevector of the adaptive codebook and the linear predictive parameter and a cross-correlation 

45 between the synthesis signal of the codevector of the adaptive codebook and the synthesis signal of the 
codevector of the excitation codebook; and 

searching the gain codebook using the corrected autocorrelation and a cross-correlation between a 
signal obtained by subtraction of the synthesis signal of the codevector of the adaptive codebook from the 
input speech signal and the synthesis signal of the codevector of the excitation codebook. 

so In the speech coding method, the adaptive codebook is searched for an adaptive codevector which 
minimizes the following error C: 



N - 1 



55 



C = Z {xw* (n) - (n) }2 



(1) 



n ■ 0 
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for 

£ = < X w\ xw'X^xw'.Sad) (2) 

5 where xw' is a signal obtained by subtraction of an influence signal from an input perceptually weighted 
signal, Sa^ is a perceptually weighted synthesized signal of an adaptive codevector of a delay d. 0 is an 
optimal gain of the adaptive codevector, N is a length of a subframe, and <", "> is an inner product. 

Subsequently, the excitation codebook is searched for an excitation codevector which minimizes, for the 
selected adaptive codevector a<j, the following error D: 

10 

N - 1 

D = 2 {xa(n) - ySCi'(n)} 2 (3) 

n = 8 

>5 

for 

y - <xa, xa>/<xa, ScT) (4) 

20 xa(n) = xw'(n) - tfSa^n) (5) 

where ScC is a perceptually weighted synthesized signal of an excitation codevector c t of an index i 
orthogonalized with respect to the perceptually weighted synthesized signal of the selected adaptive 
codevector, and y is an optimal gain of the excitation codevector. 

25 While a method of orthogonalizing a perceptually weighted synthesized signal of an excitation codevec- 

tor c ( of an index i with respect to a perceptually weighted synthesized signal of a selected adaptive 
codevector in order to find simultaneous optimal gains is already known, for example, from reference 1 
mentioned hereinabove, the method requires a very large amount of calculation. Thus, such amount of 
calculation is reduced by calculating an excitation codevector D in the following manner. 

30 First, the equation (4) is substituted into the equation (3): 

D = <xa,xa> - <xa, Sc^Sd', Sq') (6) 

Then, the following equation (7) is substituted into the equation (6). and then since xa and Sa<j are 
35 orthogonal to each other, the equation (8) is obtained: 

Sci f = Sci - Saa * <Sad, SCi>/<Sad, Sa<,> (7) 

D = <xa, xa> - <xa, Sc } > 2 / {<Sc if Sc t > - (Sa^ Sc^Sa,,, Sa^)} (8) 

40 

Finally, the gain codebook is searched for a gain codevector which minimizes, for the selected adaptive 
codevector and excitation codevector, the following error E: 

N - 1 

45 E = 2 {xw'(n) - 0jSad(n) - riSci(n)} 2 (9) 

n =* a 

where (fy , >j) is a gain codevector of an index j. 

so The gain codebook may be a signal two-dimensional codebook consisting of gains of the adaptive 
codebook and gains of the excitation codebook or else may consist of two codebooks including a one- 
dimensional gain codebook consisting of gains of the adaptive codebook and another one-dimensional gain 
codebook consisting of gains of the excitation codebook. 

Thus, the speech coding method is characterized in that, when the excitation codebook is to be 

55 searched using an optimal gain as gains of an adaptive codevector and an excitation codevector, the 
equation (7) is not calculated directly, but the equation (8) based on correlation calculation is used. 

Now, if the length of a subframe is N and the excitation codebook has a size of B bits, then the 
equation (7) requires N'2 B times of calculating operations because Sa^ is multiplied by <Sad, SqMSad, Sad>, 
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but the equation (8) requires an N times of calculating operations for the calculation of (Sa^, Sc^KSa^, Sa^). 
Consequently, calculating operations can be reduced by N{2 B -1) times. Besides, a similarly high sound 
quality can be attained. 

According to another aspect of the present invention, there is provided a speech coding method for 
5 coding an input speech signal using a linear predictive analyzer for receiving such input speech signal 
divided into frames of a fixed interval and finding a spectrum parameter of the input speech signal, an 
adaptive codebook which makes use of a long-term correlation of the input speech signal, an excitation 
codebook representing an excitation signal of the input speech signal, and a gain codebook for quantizing a 
gain of the adaptive codebook and a gain of the excitation codebook, which method comprises at least the 
w step of: 

searching the gain codebook for a codevector using a normalization coefficient which is calculated from 
an autocorrelation of a synthesis signal of an adaptive codevector from the adaptive codebook, a cross- 
correlation between a synthesis signal of the adaptive codevector and the synthesis signal of the excitation 
codevector, an autocorrelation of the synthesis signal of the excitation codevector, and an autocorrelation of 
;s the input speech signal or an estimated value of such autocorrelation of the input speech signal. 

In the speech coding method, the adaptive codebook is searched for an adaptive codevector which 

minimizes the following error C: 

n - 1 

20 C = £ {xw*(n) - 0Sa d (n)}* (10) 

n = 0 

for 

25 

0 = < xw \ xw*>/<xw\ Sa^ (11) 

where xw' is a signal obtained by subtraction of an influence signal from an input perceptually weighted 
signal, Sad is a perceptually weighted synthesized signal of an adaptive codevector of a delay d, & is an 
30 optimal gain of the adaptive codevector, N is a length of a subframe (for example, 5 ms), and <","> is an 
inner product. 

Subsequently, the excitation codebook is searched for an excitation codevector which minimizes, for the 
selected adaptive codevector ad, the following error D: 

35 

N - i 

D = 2 {xa(n) - ySCi(n)}2 (12) 

n « 0 

40 for 

y = <xa, xa)/<xa, Sq) (13) 
xa(n) = xw'(n) - jSSa^n) (14) 

45 

where Sci is a perceptually weighted synthesized signal of an excitation codevector c f of an index i, and y is 
an optimal gain of the excitation codevector. Sq may be a perceptually weighted synthesized signal of an 
excitation codevector q of an index i orthogonalized with respect to a perceptually weighted synthesized 
signal of the selected adaptive codevector. 
so Finally, the gain codebook is searched for a gain codevector which minimizes, for the selected adaptive 
codevector and excitation codevector, the following error E of the equation (15). The gain codebook here 
need not be a two-dimensional codebook. For example, the gain codebook may consist of two codebooks 
including a one-dimensional gain codebook for the quantization of gains of the adaptive codebook and 
another one-dimensional gain codebook for the quantization of gains of the excitation codebook. 

55 
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N - 1 

E - £ {xw'(n) - 0iSa d (n) - rj SCi(n)} 2 (15) 

n > 9 

5 

for 

= Gjj • XRMS/ARMS - -yj * <Sad, SCiMSa^,, Sa<,> (16) 
w n = G 2j * XRMS/CRMS (17) 
ARMS = ((Saa, Sa^/N) 172 (18) 

CRMS = {(Sq . sci> - (S^, SCj> 2 /<Sad. Sad»/N} 1/2 (19) 

15 

where XRMS is a quantized RMS of a weighted speech signal for one frame (for example, 20 ms), and (Gij, 
G2j) is a gain codevector of an index j. 

While XRMS is a quantized RMS of a weighted speech signal for one frame, a value obtained by 
interpolation (for example, logarithmic interpolation) into each subframe using a quantized RMS of a 
20 weighted speech signal of a preceding frame may be used instead. 

The speech coding method is thus characterized in that normalized gains are used as a gain codebook. 
Since a dispersion of gains is decreased by the normalization, the gain codebook having the normalized 
gains as codevectors has a superior quantizing characteristic, and as a result, coded speech of a high 
quality can be obtained. 

25 The above and other objects, features and advantages of the present invention will become apparent 

from the following description and the appended claims, taken in conjunction with the accompanying 
drawings in which like parts or elements are denoted by like reference characters. 

FIG. 1 is a block diagram showing a coder which is used in putting a speech coding method according 
to the present invention into practice: 
30 FIG. 2 is a block diagram showing a decoder which is used in putting the speed coding method 
according to the present invention into practice; 

FIG. 3 is a block diagram showing another coder which is used in putting the speed coding method 
according to the present invention into practice; 

FIG. 4 is a block diagram showing another decoder which is used in putting the speed coding method 

35 according to the present invention into practice; and 

FIG. 5 is a block diagram showing a gain calculating circuit of the decoder shown in FIG. 4. 
Referring first to FIG. 1, there is shown a coder which is used in putting a speech coding method 
according to the present invention into practice. The coder receives an input speech signal by way of an 
input terminal 100. The input speech signal is supplied to a linear predictor 110. an adaptive codebook 

40 search circuit 130 and a gain codebook search circuit 220. The linear predictor 110 performs a linear 
predictive analysis of the speech signal divided into frames of a fixed length (for example, 20 ms) and 
outputs a spectrum parameter to a weighting synthesis filter 150, the adaptive codebook search circuit 130 
and the gain codebook search circuit 220. Then, the following processing is performed for each of 
subframes (for example. 5 ms) into which each frame is further divided. 

45 In particular, adaptive codevectors of delays d are outputted from the adaptive codebook 120 to the 
adaptive codebook search circuit 130, at which searching for an adaptive codevector is performed. From the 
adaptive codebook search circuit 130, a selected delay d is outputted to a multiplexer 230; the adaptive 
codevector of the selected delay d is outputted to the gain codebook search circuit 220; a weighted 
synthesis signal Sa^ of the adaptive codevector ad of the selected delay d is outputted to a cross-correlation 

so circuit 160; an autocorrelation (Sad. Sad) of the weighted synthesis signal Sad of the adaptive codevector ad 
of the selected delay d is outputted to an orthogonalization cross-correlation circuit 190; and a signal xa 
obtained by subtraction from the input speech signal of a signal obtained by multiplication of the weighted 
synthesis signal Sad of the adaptive codevector ad of the selected delay d by an optimal gain £ is outputted 
to another cross-correlation circuit 180. 

55 An excitation codebook 140 outputs excitation codevectors q of indices i to the weighting synthesis 

filter 150 and a (cross-correlation^autocorrelation) maximum value search circuit 200. The weighting 
synthesis filter 150 weighted synthesizes the excitation codevectors Cj and outputs them to the cross- 
correlation circuit 160, an autocorrelation circuit 170 and the cross-correlation circuit 180. The cross- 
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correlation circuit 160 calculates cross-correlations between the weighted synthesis signal Sa^ of the 
adaptive codevector ad and weighted synthesis signals Sc t of the excitation codevector Cj and outputs them 
to the orthogonalization autocorrelation circuit 190. The autocorrelation circuit 170 calculates autocor- 
relations of the weighted synthesis signals Sc* of the excitation codevectors q and outputs them to the 

5 orthogonalization autocorrelation circuit 190. The cross-correlation circuit 180 calculates cross-correlations 
between the signal xa and the weighted synthesis signal Sc ( of the excitation codevector c ( and outputs 
them to the (cross-correlation^autocorrelation) maximum value search circuit 200. 

The orthogonalization autocorrelation circuit 190 calculates autocorrelations of weighted synthesis 
signals Sc/ of the excitation codevectors q which are orthogonalized with respect to the weighted synthesis 

w signal Sa^j of the adaptive codevector a^, and outputs them to the (cross-correlation^/fautocorrelation) 
maximum value search circuit 200. The (cross-correlation) 2 /(autocorrelation) maximum value search circuit 
200 searches for an index i with which the (cross-correlation between the signal xa and the weighted 
synthesis signal Sq' of the excitation codevector C; orthogonalized with respect to the weighted synthesis 
signal Sa<j of the adaptive codevector ad) 2 /(autocorrelation of the weighted synthesis signal Sc; 1 of the 

15 excitation codevector Cj orthogonalized with respect to the weighted synthesis signal Sa^j of the adaptive 
codevector a^) presents a maximum value, and the index i thus searched out is outputted to the multiplexer 
230 while the excitation codevector d is outputted to the gain codebook search circuit 220. Gain 
codevectors of the indices j are outputted from a gain codebook 210 to the gain codebook search circuit 
220. The gain codebook search circuit 220 searches for a gain codevector and outputs the index j of the 

20 selected gain codevector to the multiplexer 230. 

Referring now to FIG. 2, there is shown a decoder which is used in putting the speed coding method 
according to the present invention into practice. The decoder includes a demultiplexer 240, from which a 
delay d for an adaptive codebook is outputted to an adaptive codebook 250; a spectrum parameter is 
outputted to a synthesis filter 310; an index i for an excitation codebook is outputted to an excitation 

25 codebook 260; and an index j for a gain codebook is outputted to a gain codebook 270. An adaptive 
codevector a<j of the delay d is outputted from the adaptive codebook 250; an excitation codevector C\ of the 
index i is outputted from the excitation codebook 260; and gain codevector (0j -yj) of the index j are 
outputted from the gain codebook 270. The adaptive codevector a<j and the gain codevector £j are 
multiplied by a multiplier 280 while the excitation codevector q and the gain codevector 7j are multiplied by 

30 another multiplier 290, and the two products are added by an adder 300. The sum thus obtained is 
outputted to the adaptive codebook 250 and the synthesis filter 310. The synthesis filter 310 synthesizes 
• £j + Cj * yi and outputs it by way of an output terminal 320. 

The gain codebook may be a single two-dimensional codebook consisting of gains for an adaptive 
codebook and gains for an excitation codebook or may consist of two codebooks including a one- 

35 dimensional gain codebook consisting of gains for an adaptive codebook and another one-dimensional gain 
codebook consisting of gains for an excitation codebook. 

When <xa, SCj> of the equation (8) given hereinabove is to be calculated by the cross-correlation circuit 
180, it may alternatively be calculated in accordance with the following equation in order to reduce the 
amount of calculation: 

40 

H - 1 

<xa, Sci> = £ p(k) ci (k) (20) 

k » 0 

45 

for 

N - 1 

p(k) = Z xa(n) h(n-k) (21) 

50 n = k 



where h is an impulse response of the weighted synthesis filter. 

Meanwhile, when <Sad. SCj> of the equation is to be calculated by the cross-correlation circuit 160, it 
55 may alternatively be calculated in accordance with the following equation in order to reduce the amount of 
calculation: 
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N - I 

<Sa d , Sci> = S q(k) Ci (k) (22) 
k = e 



5 

for 



N - 1 

>o q(k) = 2 Sa d (n) h(n-k) (23) 

n = k 

On the other hand, when <Sc it SCj> of the equation (8) is to be calculated by the autocorrelation circuit 
;s 170, alternatively it may be calculated approximately in accordance with the following equation in order to 
reduce the amount of calculation; 

N - l 

<Sci . Sci > = u(0) v(0) + 22 ti(m) v(m) (24) 
2 ° «.» i 

for 

25 

N - n - 1 

*i(m) = 2 h(n) h(n+m) (25) 

n » 0 



30 

H - n - 1 

v(m) = 2 c(n) c(n+m) (26) 
n « a 

35 

In the meantime, in order to improve the performance, a combination of a delay and an excitation which 
minimizes the error between a weighted input signal and a weighted synthesis signai may be found after a 
plurality of candidates are found for each delay d from within the adaptive codebook and then excitation of 

40 the excitation codebook are orthogonalized with respect to the individual candidates. In this instance, when 
<Sad, Scj) of the equation (8) is to be calculated by the cross-correlation circuit 160, it may otherwise be 
calculated in accordance with the following equation (27) in order to reduce the amount of calculation. In 
this case, however, instead of inputting Sa<j to the cross-correlation circuit 160, xa and an optimal gain 0 of 
an adaptive codevector are inputted from the adaptive codebook search circuit 130 and <xa, Sq> are 

45 inputted from the cross-correlation circuit 180 to the cross-correlation circuit 160. 

<Sad, SCj> = (<xw\ SCi> - <xa, Sc;) yp (27) 

The calculation of (Sad, Sc ( > in accordance with the equation (27) above eliminates the necessity of 
so calculation of an inner product which is performed otherwise each time the adaptive codebook changes, and 

consequently, the total amount of calculation can be reduced. 

Further, in order to further improve the performance, a combination of a delay of the adaptive codebook 

and an excitation of the excitation codebook need not be determined decisively for each subframe, but may 

otherwise be determined such that a plurality of candidates are found for each subframe, and then an 
55 accumulated error power is found for the entire frame, whereafter a combination of a delay of the adaptive 

codebook and an excitation of the excitation codebook which minimizes the accumulate error power is 

found. 

Referring now to FIG. 3, there is shown another coder which is used in putting the speech coding 
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method according to the present invention into practice. The coder receives an input speech signal by way 
of an input terminal 400. The input speech signal is supplied to a weighting filter 405 and a linear predictive 
analyzer 420. The linear predictive analyzer 420 performs a linear predictive analysis and outputs a 
spectrum parameter to the weighting filter 405, an influence signal subtracting circuit 415, a weighting 
5 synthesis filter 540, an adaptive codebook search circuit 460, an excitation codebook search circuit 480, and 
a multiplexer 560. 

The weighting filter 405 perceptually weights the speech signal and outputs it to a subframe dividing 
circuit 410 and an autocorrelation circuit 430. The subframe dividing circuit 410 divides the perceptually 
weighted speech signal from the weighting filter 405 into subframes of a predetermined length (for example, 

70 5 ms) and outputs the weighted speech signal of subframes to the influence signal subtracting circuit 415, 
at which an influence signal from a preceding subframe is subtracted from the weighted speech signal. The 
influence signal subtracting circuit 415 thus outputs the weighted speech signal, from which the influence 
signal has been subtracted, to the adaptive code book search circuit 460 and a subtractor 545. Meanwhile, 
adaptive codevectors a<j of delays d are outputted from the adaptive codebook 450 to the adaptive 

15 codebook search circuit 460, by which the adaptive codebook 450 is searched for an adaptive codevector. 
From the adaptive codebook search circuit 460, a selected delay d is outputted to the multiplexer 560; the 
adaptive codevector ad of the selected delay d is outputted to a multiplier 522; a weighted synthesis signal 
Saa of the adaptive codevector ad of the selected delay d is outputted to an autocorrelation circuit 490 and 
a cross-correlation circuit 500; and a signal xa obtained by subtraction from the weighted speech signal of a 

20 signal obtained by multiplication of the weighted synthesis signal Sad ot the adaptive codevector ad of the 
selected delay d by an optimal gain 0 is outputted to the excitation codebook search circuit 480. 

The excitation codebook search circuit 480 searches the excitation codebook 470 and outputs an index 
of a selected excitation codevector to the multiplexer 560, the selected excitation codevector to a multiplier 
524, and a weighted synthesis signal of the selected excitation codevector to the cross-correlation circuit 

25 500 and an autocorrelation circuit 510. In this instance, a search may be performed after orthogonalization 
of the excitation codevector with respect to the adaptive codevector. 

The autocorrelation circuit 430 calculates an autocorrelation of the weighted speech signal of the frame 
length and outputs it to a quantizer for RMS of input speech signal 440. The quantizer for RMS of input 
speech signal 440 calculates an RMS of the weighted speech signal of the frame length from the 

30 autocorrelation of the weighted speech signal of the frame length and u-law quantizes it, and then outputs 
the index to the multiplexer 560 and the quantized RMS of input speech signal to a gain calculating circuit 
520. The autocorrelation circuit 490 calculates an autocorrelation of the weighted synthesis signal of the 
adaptive codevector and outputs it to the gain calculating circuit 520. The cross-correlation circuit 500 
calculates a cross-correlation between the weighted synthesis signal of the adaptive codevector and the 

35 weighted synthesis signal of the excitation codevector and outputs it to the gain calculating circuit 520. The 
autocorrelation circuit 510 calculates an autocorrelation of the weighted synthesis signal of the excitation 
codevector and outputs it to the gain calculating circuit 520. 

Gain codevectors of the indices j are outputted from a gain codebook 530 to the gain calculating circuit 
520, at which gains are calculated. Thus, a gain of the adaptive codevector is outputted from the gain 

40 calculating circuit 520 to the multiplier 522 while another gain of the excitation codevector is outputted to 
the multiplier 524. The multiplier 522 multiples the adaptive codevector from the adaptive codebook search 
circuit 460 by the gain of the adaptive codevector while the multiplier 524 multiplies the excitation 
codevector from the excitation codebook search circuit 480 by the gain of the excitation codevector, and the 
two products are added by an adder 526 and the sum thus obtained is outputted to the weighting synthesis 

45 filter 540. The weighting synthesis filter 540 weighted synthesizes the sum signal from the adder 526 and 
outputs the synthesis signal to the subtractor 545. The subtractor 545 subtracts the output signal of the 
weighting synthesis filter 540 from the speech signal of the subframe length from the influence signal 
subtracting circuit 415 and outputs the difference signal to a squared error calculating circuit 550. The 
squared error calculating circuit 550 searches a gain codevector which minimizes the squared error, and 

so outputs an index of the gain codevector to the multiplexer 560. 

When a gain is to be calculated by the gain calculating circuit 520, instead of using a quantized RMS of 
input speech signal itself, another value may be employed which is obtained by interpolation (for example, 
logarithmic interpolation) into each subframe using a quantized RMS of input speech signal of a preceding 
frame and another quantized RMS of input speech signal of a current frame. 

55 Referring now to FIG. 4, there is shown another decoder which is used in putting the speech coding 

method according to the present invention into practice. The decoder includes a demultiplexer 570, from 
which an index of a RMS of input speech signal is outputted to a decoder for RMS of input speech signal 
580; a delay of an adaptive codevector is outputted to an adaptive codebook 590; an index to an excitation 
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codevector is outputted to an excitation codebook 600; an index to a gain codevector is outputted to a gain 
codebook 610; and a spectrum parameter is outputted to a weighting synthesis filter 620, another weighting 
synthesis filter 630 and a synthesis filter 710. 

The RMS of input speech signal is outputted from the decoder for RMS of input speech signal 580 to a 

5 gain calculating circuit 670. The adaptive codevector is outputted from the adaptive codebook 590 to the 
synthesis filter 620 and a multiplier 680. The excitation codevector is outputted from the excitation 
codebook 600 to the weighting synthesis filter 630 and a multiplier 690. The gain codevector is outputted 
from the gain codebook 610 to the gain calculating circuit 670. The weighted synthesis signal of the 
adaptive codevector is outputted from the weighting synthesis filter 620 to an autocorrelation circuit 640 and 

70 a cross-correlation circuit 650 while the weighted synthesis signal of the excitation codevector is outputted 
from the weighting synthetic filter 630 to another autocorrelation circuit 660 and the cross-correlation circuit 
650. 

The autocorrelation circuit 640 calculates an autocorrelation of the weighted synthesis s>gcai of the 
adaptive codevector and outputs it to the gain calculating circuit 670. The cross-correlation circuit 650 

is calculates a cross-correlation between the weighted synthesis signal of the adaptive code veer jr and the 
weighted synthesis signal of the excitation codevector and outputs it to the gain calculating circuit 670. The 
cross-correlation circuit 660 calculates an autocorrelation of the weighted synthesis signal of the excitation 
codevector and outputs it to the gain calculating circuit 670. 

The gain calculating circuit 670 calculates a gain of the adaptive codevector and a gain of the excitation 

20 codevector using the equations (16) to (19) given hereinabove and outputs the gain of the adaptive 
codevector to the multiplier 680 and the gain of the excitation codevector to the multiplier 690. The 
multiplier 680 multiplies the adaptive codevector from the adaptive codebook 59 by the gain of the adaptive 
codevector while the multiplier 690 multiplies the excitation codevector from the excitation codebook 600 by 
the gain of the excitation codevector, and the two products are added by an adder 700 and outputted to the 

25 synthesis filter 710. The synthesis filter 710 synthesizes such signal and outputs it by way of an output 
terminal 720. 

When a gain is to be calculated by the gain calculating circuit 670. instead of using a quantized RMS of 
input speech signal itself, another value may be employed which is obtained by interpolation (for example, 
logarithmic interpolation) into each subframe using a quantized RMS of input speech signal of a preceding 

30 frame and another quantized RMS of input speech signal of a current frame. 

Referring now to FIG. 5, the gain calculating circuit 670 is shown more in detail. The gain calculating 
circuit 670 receives a quantized RMS of the input speech signal (hereinafter represented as XRMS) by way 
of an input terminal 730. The quantized XRMS of the input speech signal is supplied to a pair of dividers 
850 and 870. An autocorrelation <Sa, Sa) of a weighted synthesis signal of an adaptive codevector is 

35 received by way of another input terminal 740 and supplied to a multiplier 790 and a further divider 800. A 
cross-correlation <Sa, Sc> between the weighted synthesis signal of the adaptive codevector and a weighted 
synthesis signal of an excitation codevector is received by way of a further input terminal 750 and supplied 
to the divider 800 and a multiplier 810. An autocorrelation <Sc, Sc> of the weighted synthesis signal of the 
excitation codevector is received by way of a still further input terminal 760 and transmitted to a subtractor 

40 820. A first component Gi of a gain codevector is received by way of a yet further input terminal 770 and 
transmitted to a multiplier 890. A second component G2 of the gain codevector is inputted by way of a yet 
further input terminal 780 and supplied to a multiplier 880. 

The multiplier 790 multiplies the autocorrelation <Sa, Sa> by 1/N and outputs the product to a root 
calculating circuit 840. which thus calculates a root of <Sa, Sa>/N and outputs it to the divider 850. Here, N is 

45 a length of a subframe (for example, 40 samples). The divider 850 divides the quantized XRMS of the input 
speech signal by (<Sa, Sa>/N) 1/2 and outputs the quotient to the multiplier 890, at which XRMS/(<Sa, Sa>/N) 1/2 
is multiplied by the first component Gi of the gain codevector. The product at the multiplier 890 is 
outputted to the subtractor 900. 

The divider 800 divides the cross-correlation <Sa, Sc> by the autocorrelation <Sa, Sa> and outputs the 

50 quotient to the multipliers 810 and 910. The multiplier 810 multiplies the quotient <Sa, Sc>/<Sa, Sa> by the 
cross-correlation <Sa, Sc> and outputs the product to the subtractor 820. The subtractor 820 subtracts <Sa, 
Sc> 2 /<Sa, Sa> from the autocorrelation <Sc, Sc> and outputs the difference to the multiplier. 830, at which the 
difference is multiplied by 1/N. The product is outputted from the multiplier 830 to the root calculating 
circuit 860. The root calculating circuit 860 calculates a root of the output signal of the multiplier 830 and 

55 outputs it to the divider 870. The divider 870 divides the quantized XRMS of the input speech signal from 
the input terminal 730 by {(<Sc, Sc> - <Sa, Sc^Sa, Sa>)/N} 1/2 and outputs the quotient to the multiplier 800. 
The multiplier 880 multiplies the quotient by the second component G2 of the gain codevector and outputs 
the product to the multiplier 910 and an output terminal 930. The multiplier 910 multiplies the output of the 
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multiplier 880, i.e., G 2 •XRMS/^Sc. Sc> -<Sa, ScWSa. Sa>)/N} 1/2 , by <Sa, Sc>/<Sa, Sa> and outputs the 
product to the subtractor 900. The subtractor 900 subtracts the product from the multiplier 910 from Gi * 
XRMS/(<Sa, Sa>/N) 1/2 and outputs the difference to another output terminal 920. 

The gain codebook described above need not necessarily be a two-dimensional codebook. For 
5 example, the gain codebook may consist of two codebooks including a one-dimensional gain codebook 
consisting of gains for an adaptive codebook and another one-dimensional gain codebook consisting of 
gains for an excitation codebook. 

The excitation codebook may be constituted from a random number signal as disclosed in reference 3 
mentioned hereinabove or may otherwise be constituted by learning in advance using a training data. 
w Having now fully described the invention, it will be apparent to one of ordinary skill in the art that many 
changes and modifications can be made thereto without departing from the spirit and scope of the invention 
as set forth herein. 

Claims 

75 

1. A speech coding method for coding an input speech signal using a linear predictive analyzer for 
receiving such input speech signal divided into frames of a fixed interval and finding a linear predictive 
parameter of the input speech signal, an adaptive codebook which makes use of a long-term correlation 
of the input speech signal, an excitation codebook representing an excitation signal of the input speech 

20 signal, and a gain codebook for quantizing a gain of said adaptive codebook and a gain of said 

excitation codebook, comprising at least the steps of: 

correcting an autocorrelation of a synthesis signal synthesized from a codevector of said excitation 
codebook and the linear predictive parameter using an autocorrelation of a synthesis signal synthesized 
from a codevector of said adaptive codebook and the linear predictive parameter and a cross- 

25 correlation between the synthesis signal of the codevector of said adaptive codebook and the synthesis 

signal of the codevector of said excitation codebook; and 

searching said gain codebook using the corrected autocorrelation and a cross-correlation between 
a signal obtained by subtraction of the synthesis signal of the codevector of said adaptive codebook 
from the input speech signal and the synthesis signal of the codevector of said excitation codebook. 

30 

2. A speech coding method for coding an input speech signal using a linear predictive analyzer for 
receiving such input speech signal divided into frames of a fixed interval and finding a spectrum 
parameter of the input speech signal, an adaptive codebook which makes use of a long-term correlation 
of the input speech signal, an excitation codebook representing an excitation signal of the input speech 

35 signal, and a gain codebook for quantizing a gain of said adaptive codebook and a gain of said 

excitation codebook, comprising at least the step of: 

searching said gain codebook for a codevector using a normalization coefficient which is calculated 

from an autocorrelation of a synthesis signal of an adaptive codevector from said adaptive codebook, a 

cross-correlation between a synthesis signal of the adaptive codevector and the synthesis signal of said 
40 excitation codevector, an autocorrelation of the synthesis signal of the excitation codevector, and an 

autocorrelation of the input speech signal or an estimated value of such autocorrelation of the input 

speech signal. 

3. A speech coding system, comprising: 

4 5 a linear predictive analyzer for receiving an input speech signal divided into frames of a fixed 

interval and finding a linear predictive parameter of the input speech signal; 
an adaptive codebook representing codevectors; 

an adaptive codebook search circuit for searching said adaptive codebook using the linear 
predictive parameter of said linear predictive analyzer; 
so an excitation codebook representing excitation codevectors; 

a gain codebook for quantizing a gain of said adaptive codebook and a gain of said excitation 
codebook; 

means for correcting an autocorrelation of a synthesis signal synthesized from a codevector of said 
excitation codebook and the linear predictive parameter using an autocorrelation of a synthesis signal 
55 synthesized from a codevector of said adaptive codebook and the linear predictive parameter and a 

cross-correlation between the synthesis signal of the codevector of said adaptive codebook and the 
synthesis signal of the codevector of said excitation codebook; 
and 
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means for searching said gain codebook using the corrected autocorrelation from said correcting 
means and a cross-correlation between a signal obtained by subtraction of the synthesis signal of the 
codevector of said adaptive codebook from the input speech signal and the synthesis signal of the 
codevector of said excitation codebook. 

A speech coding system, comprising: 

a linear predictive analyzer for receiving an input speech signal divided into frames of a fixed 
interval and finding a spectrum parameter of the input speech signal: 

an adaptive codebook representing codevectors; 

an adaptive codebook search circuit for searching said adaptive codebook using the linear 
predictive parameter of said linear predictive analyzer: 

an excitation codebook representing excitation codevectors; 

a gain codebock for quantizing a gain of said adaptive codebook and a gain of said excitation 
codebook; and 

means for s*»a/cning said gain coOebook for a codevector using a normalization coefficient which is 
calculated from an autocorrelation of a synthesis signal of an adaptive codevector from said adaptive 
codebook search circuit, a cross-correlation between a synthesis signal of the adaptive codevector and 
the synthesis signal of said excitation codevector, an autocorrelation of the synthesis signal of the 
excitation codevector, and an autocorrelation of the input speech signal or an estimated value of such 
autocorrelation of the input speech signal. 
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